Rank | Count | Beginning |
---|---|---|
8227 | 2750 | Η |
15787 | 2187 | Ο |
26926 | 1529 | Το |
16680 | 1155 | Οι |
25634 | 465 | Τα |
4100 | 451 | Δεν |
24908 | 440 | Σύμφωνα |
12093 | 437 | Και |
13764 | 435 | Με |
22457 | 432 | Σε |
24133 | 428 | Στο |
3340 | 416 | Για |
1678 | 385 | Από |
19317 | 374 | Οπως |
5549 | 334 | Είναι |
23483 | 331 | Στην |
26386 | 329 | Την |
2632 | 259 | Αυτό |
23403 | 227 | Στη |
8303 | 202 | «Η |
11477 | 200 | Θα |
20057 | 187 | Όταν |
899 | 186 | Αν |
14763 | 184 | Μία |
14189 | 178 | Μετά |
29780 | 151 | Ωστόσο, |
15339 | 148 | Να |
599 | 142 | Αλλά |
6493 | 140 | Ένα |
23987 | 136 | Στις |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV